ftp.cs.arizona.edu

home *** CD-ROM | disk | FTP | other *** search

/ ftp.cs.arizona.edu / ftp.cs.arizona.edu.tar / ftp.cs.arizona.edu / icon / newsgrp / group98c.txt / 000021_icon-group-sender _Mon Sep 14 08:23:43 1998.msg < prev next >

Wrap

Internet Message Format | 2000-09-20 | 2KB

Return-Path: <icon-group-sender> Received: from kingfisher.CS.Arizona.EDU (kingfisher.CS.Arizona.EDU [192.12.69.239]) by baskerville.CS.Arizona.EDU (8.9.1a/8.9.1) with SMTP id IAA06280 for <icon-group-addresses@baskerville.CS.Arizona.EDU>; Mon, 14 Sep 1998 08:23:42 -0700 (MST) Received: by kingfisher.CS.Arizona.EDU (5.65v4.0/1.1.8.2/08Nov94-0446PM) id AA01577; Mon, 14 Sep 1998 08:23:15 -0700 To: icon-group@optima.CS.Arizona.EDU Date: 12 Sep 1998 07:30:31 GMT From: MUSKB@luff.latrobe.edu.au (Bastin,kim) Message-Id: <6td7un$cpr$2@news.latrobe.edu.au> Organization: La Trobe University Sender: icon-group-request@optima.CS.Arizona.EDU References: <4FD6422BE942D111908D00805F3158DF0757B57E@RED-MSG-52> Subject: Re: Unicode support or support for non-ASCII based character ma Errors-To: icon-group-errors@optima.CS.Arizona.EDU Status: RO Todd Proebsting (toddpro@microsoft.com) wrote: : Jcon (http://www.cs.arizona.edu/icon/jcon) originally used Java's Unicode : strings as the basis for implementing Icon strings. This was trivial to do, : and we (Gregg Townsend and I) would have kept that implementation except : that it was much too slow. I blame this in large part on the Java "String" : implementation, and not on Unicode. (Although Unicode obviously wastes a : lot of space in applications that are just using the ASCII subset.) : Prior to switching to ASCII, we never faced the problem of what was the : "right" thing to do with pre-defined csets with Unicode. For instance, what : are the elements of the cset &ucase in an Unicode world? : Todd Proebsting Icon's &ucase (A-Z) is based on the English alphabet. Other European languages would add @ADCG... etc. But it doesn't stop there. Whether you have upper vs lowercase at all depends on the alphabet. Not all alphabets have such a distinction. Some have other distinctions that are novel to users of the Roman alphabet. For example, Greek, Hebrew and Arabic have special 'final' forms of some characters (used at the end of a word). Kim Bastin